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Searches for B mesons decaying to final states containing a baryon and a lepton are performed, 
where the baryon is either A c or A and the lepton is a muon or an electron. These decays violate 
both baryon and lepton number and would be a signature of physics beyond the standard model. 
No significant signal is observed in any of the decay modes, and upper limits in the range (3.2 — 
520) x 10 -8 are set on the branching fractions at the 90% confidence level. 

PACS numbers: 13.25.Hw, 11.30.Fs, 14.80.Sv 



I. INTRODUCTION 

Observations show that the universe contains much 
more matter than antimatter [l|, This suggests 

that there are processes that violate CP-symmetry and 
baryon- number conservation However, experimen- 
tally observed CP violation, combined with the baryon- 
number violating processes that are allowed by the stan- 
dard model cannot explain the observed matter- 
antimatter asymmetry. 

Baryon-numbcr violation is a prediction of many uni- 
fication theories d, [f|, but the proton decay rates pre- 
dicted by many of these models have not been observed. 
Stringent limits have been placed on the lifetime of the 
proton Q. The non-observation of proton decay has 
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been used to constrain baryon- and lepton-number vio- 
lating decays involving higher-generation quarks and lep- 
tons [8|; in that study, the upper limit on the branching 
fraction for B° — > A+l~ is calculated to be 4 x 10 -29 , 
where I is a lepton. No upper limits are calculated for 
B~ decays to Al~ or At" . 

We report the results of searches for the decays B° — » 
A+£~, B~ -> At", and B~ ->• At", where the lepton is 
a muon or an electron Q. Neither lepton number nor 
baryon number are conserved in these decays. This is 
the first measurement of the branching fractions for these 
decays. An observation of any of these decay processes 
would be a sign of new physics. 



II. THE BABAR DETECTOR AND DATASET 

The data used in this analysis were recorded with the 
BABAR detector at the PEP-II asymmetric energy e + e~ 
storage ring. The data sample consists of 429.0 fb _1 
recorded at the T(4S) resonance (y / i=10.58 GeV/c 2 , 
where \J~s is the center-of-mass (CM) energy of the e + e~ 
system). The sample contains (471 ± 3) x 10 6 BB pairs. 

The BABAR detector is described in detail else- 
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where [lO|. Charged particle momenta are measured in 
a tracking system consisting of a five-layer, double-sided 
silicon vertex tracker (SVT) and a 40-layer central drift 
chamber (DCH), immersed in a 1.5 T axial magnetic 
field. Photon and electron energies are measured in a 
CsI(Tl) electromagnetic calorimeter (EMC). The mag- 
netic flux return for the solenoid (IFR), instrumented 
with resistive plate chambers or limited streamer tubes, 
provides muon identification. Charged particle identifi- 
cation (PID) is also provided by an internally reflecting 
ring-imaging Cherenkov detector (DIRC) and the energy 
loss dE/dx measured by the SVT and DCH. Information 
from all these detectors is used in the particle identifica- 
tion. 

Simulated Monte Carlo (MC) events are generated to 
study detector effects. The detector response is mod- 
eled using the GEANT4 software package 11] . Large 



numbers of signal events are generated for the six decay 
modes, assuming that the B meson decays do not pro- 
duce any preferred polarization of the A£ or A. This sam- 
ple is referred to as the signal MC. For background stud- 
ies, a large sample of BB events is produced, with the 
B mesons decaying according to the measured branch- 
ing fractions 12|. The same procedure is used to gener- 
ate background samples for e + e~ annihilation to lighter 
quark-antiquark pairs (u,d,s,c). These two samples are 
referred to as background MC. 



III. OVERVIEW OF ANALYSIS 

We identify B-meson candidates using two kinematic 
variables: the difference between half the CM energy 
of the colliding beams and the measured CM energy of 
the B candidate, AE; and the energy-substituted mass 
Toes of the B candidate. In the calculation of mES, the 
precise knowledge of the initial state energy is used to 
improve the resolution on the calculated mass of the B 
candidate: toes = \/[(s/2 + pi ■ pB)/Ei} 2 — |ps| 2 , where 
(Ei,pi) and ps are respectively the four- momentum of 
the e + e~ system and the three-momentum of the B- 
meson candidate in the laboratory frame. A region of 
phase space in these two variables, in which fits will 
be performed to extract the signal yield, is defined by 
the ranges -0.2 < AE < 0.2 GeV and 5.2 < m ES < 
5.3GeV/c 2 . This is referred to as the fitting region. e+e 



The signal for true B candidates for the studied de- 
cays is centered around AE = (<r ~ 16MeV) and 
toes = 5.279 GeV/c 2 (a « 3MeV/c 2 ), where a is the ex- 
perimental resolution. 

In any search for a rare or new process, it is impor- 
tant to minimize experimenter's bias. To do this, a blind 
analysis is performed. The kinematic region of phase 
space that would be populated by true signal events is 
hidden during optimization of the candidate selection cri- 
teria. We exclude events within roughly ±5cr of the signal 
peak in toes and ±4cr in AE. The non-blinded region 
is referred to as the sideband region. We define a region 



within ±2.5cr of the signal peak in toes and AE as the 
signal region. 

The signal yield is extracted with an unbinned ex- 
tended maximum likelihood fit. The total probability 
distribution function (PDF) is a sum of PDFs for sig- 
nal and background. Each of these PDFs is a product 
of PDFs describing the dependence on toes and AE. 
For the A+l~~ modes, additional discriminating power is 
gained from a three-dimensional PDF, where the output 
from a neural net discriminator is used as the third vari- 
able. This discriminator is defined in the next section. 



IV. CANDIDATE SELECTION AND 
OPTIMIZATION 

A£ candidates are reconstructed through the decay 
mode A+ — > pK~n + , which has a branching fraction 
of (5.0 ± 1.3) x 10" 2 0- Other studies of B decays to 
A+ [l3[ show that including additional /Lt decay modes 
would add little sensitivity to this analysis. A candidates 
are reconstructed through the decay A — > p7r~, which 
has a branching fraction of (63.9 ± 0.5) x 10~ 2 0. The 
final state tracks for both the A+ and A decays are con- 
strained to a common spatial vertex and their invariant 
mass is constrained to the A£ or A mass 0. This has 
the effect of improving the four-momentum resolution for 
true B —> Ai c \i candidates. 

_B-meson candidates are formed by combining a A+ , A 
or A candidate with a fi~ or e~. The baryon and lep- 
ton candidates are constrained to originate from a com- 
mon point. The final state hadron (p,K,ir) and lepton 
(/i, e) candidates are all required to be consistent with 
the candidate particle hypothesis according to PID algo- 
rithms that use dE/dx, DIRC, EMC and IFR informa- 
tion. The four-momenta of photons that are consistent 
with bremsstrahlung radiation from the electron candi- 
date are added to that of the electron. As the A has 
ct = 7.89 cm, the purity of the yl-candidate sample is 
improved by selecting candidates for which the recon- 
structed decay point of the A candidate is at least 0.2 cm 
from the reconstructed decay point of the B candidate in 
the plane perpendicular to the e + e~ beams. 

A non-negligible background for the Al channel is due 
to incorrect identification of electrons and positrons in 
+ — ^ e + e~7 events in which the photon converts into 



an additional e + e~ pair and an electron and positron 
in the final state are misidcntified as a t" and proton 
coming from a A decay. This background is almost en- 
tirely eliminated by requiring that there are more than 
four charged tracks in the event. We apply this selection 
criterion to all channels. 

Candidate selection optimization is performed balanc- 
ing two goals: setting the lowest upper limit while re- 
maining sensitive to a signal. We use the Punzi figure of 
merit (FOM) Q, e/(o/2 + ^iV bkg ), where e is the sig- 
nal efficiency, -/Vbkg is the expected number of background 
events, and a is the number of standard deviations of sig- 
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nificance at which the analysts would claim a discovery. 
For this analysis, a — 5 is used. The signal efficiency and 
the expected number of background events are obtained 
from the respective MC samples. 

For optimization of the p, K, it candidate PID selec- 
tion, we calculate the Punzi FOM by estimating e and 
-/Vbkg from the baryon candidate invariant mass distri- 
bution. The background is assumed to be linear in the 
baryon invariant mass and a fit is made to extract the 
number of signal and background candidates. After the 
PID selection is optimized, we select candidates within 
±15 MeV/c 2 of the nominal A+ mass and ±4 MeV/c 2 of 
the nominal A mass 

The lepton candidate selection is optimized based on 
the number of B — > Ai c ^t candidates in the signal region 
in MC samples. These B candidates contain correctly 
identified leptons from the signal decay, which define e in 
the Punzi FOM, and two types of background, which de- 
termine ./Vbkg: correctly identified leptons from standard 
model processes, and incorrectly identified leptons. 

A neural net is used to provide further discrimination 
between signal and background. We use the TMVA soft- 
ware package [l5| and its multilayer perceptron imple- 
mentation of a neural net. The neural net is trained us- 
ing MC simulated samples for signal and for e + e~ — » qq 
(q = u,d,s,c). The six discriminating variables, all de- 
fined in the CM frame of the e + e~ beams, used in the 
neural net algorithm are the angle between the B me- 
son momentum and the axis defined by the colliding 
e + e _ system, the angle between the B meson candidate 
sphericity [16( axis and the sphericity axis defined by the 
charged particles in the rest of the event (ROE) , the angle 
between the B meson candidate thrust [16| axis and the 
thrust axis defined by the charged particles in the ROE, 
the ratio of the 2 nd to th Fox- Wolfram moment [13] cal- 
culated from the entire event using both charged and 
neutral particles, X-moments of the ROE tracks 18j, and 
the magnitude of the thrust of the entire event. For the 
At modes, the ratio of the Fox- Wolfram moments and 
the magnitude of the thrust of the entire event show a 
slight correlation with AE and ibes in the background 
sample and are therefore not used. 

For the A+t~ decay modes, we retain events with a 
value of the neural net output above a threshold such 
that about 90% of the signal is retained and about 50% 
of the background is rejected. The neural net output 
for the retained events is used as a third discriminating 
variable in the PDF used in the fit. 

The At modes have significantly less background than 
the A+t~ modes. For the At modes, we retain events 
with a value of the neural net output above a threshold 
(optimized using the Punzi FOM) and perform a fit in 
AE and Toes only. 

After the optimized selection criteria are applied, the 
remaining background for the Ajt~ modes is composed 
of roughly equal amounts of BB and qq (q — u, d, s, c) 
events, while the background for the At modes is almost 
entirely qq. 



V. EXTRACTION OF RESULTS 

As stated earlier, the signal yield is extracted with an 
unbinned extended maximum likelihood fit. For all decay 
modes, the signal toes PDF is modeled as a Crystal Ball 
function [l9| . which has three free parameters. The signal 
AE PDF is the sum of two Crystal Ball functions with 
the same mean. For the A+l~ decay modes, the signal 
neural net output is modeled by a non-parametric PDF 
implemented in the RooFit [20] package that models the 
distribution as a superposition of Gaussian kernels plj ]. 
The full signal PDF is a product of these PDFs. Signal 
MC samples for each decay mode are used to determine 
the parameter values for these functions, and these values 
are fixed in the fit to the data. 

For all decay modes, the background toes PDF is mod- 
eled as an ARGUS function [22j and the background AE 
PDF is modeled as a linear function. The unnormalized 
ARGUS function is defined as "J (to) = mu p e cu , where 
p = 0.5, u = 1 — (m/TOo) 2 , c is the curvature parameter, 
and too is the kinematic cutoff above which the function 
is defined to be 0. We determine to = 5.290 GeV/c 2 by 
a fit to the background MC events and fix this value in 
the fit to the data. For the A+t~~ decay modes, the back- 
ground neural net output PDF is modeled as a Crystal 
Ball function. 

In the fit, the number of background events is a free 
parameter and the number of signal events S is the prod- 
uct of the branching fraction B, which is treated as a 
free parameter, and a conversion factor T: S = BT, 
where T = e$A (c) -/Vb , &a Lc) is the branching fraction for 
the A+ or A, and Nb is the number of either neutral 
or charged B mesons in the dataset; Nb = 2N T ^ 4:S - ) B B g, 
where Nx(as) is the number of T(4S) in the dataset, B B g 
is the branching fraction for the T(4S) to decay to either 
a neutral or charged BB pair, and the factor of 2 ac- 
counts for the pair of B mesons produced in each T(45) 
decay. There are no other free parameters for the signal 
PDF. The two-dimensional background PDF for the At 
modes has two free parameters (the AE slope and the 
toes ARGUS shape parameter); the three-dimensional 
background PDF for the A+t~ modes has three addi- 
tional free parameters for the Crystal Ball function that 
models the neural net output. 

In order to incorporate systematic uncertainties on T 
(discussed below) directly in the fit and propagate them 
to the total uncertainty on the branching fraction, a 
Gaussian constraint is included as a term (Q) in the ln- 
likelihood function (ln£): Q = (J 7 — J r fi t ) 2 /2J 2 r , where T 
is the value we calculate for the conversion factor, is 
a free parameter, and Sjr is the uncertainty on the con- 
version factor. The Gaussian constraint is turned off in a 
subsequent fit to extract the statistical uncertainty only. 
This error is then subtracted in quadrature from the total 
error to determine the systematic error on the branching 
fraction from these sources. 

In order to test the stability and sensitivity of the fit- 
ting procedure as well as to search for possible bias in 
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the fit, simulated signal events are embedded in a sam- 
ple of background events generated from the background 
PDFs using MC techniques. We generate many indepen- 
dent samples with varying ratios of the number of em- 
bedded signal events to the number of background events 
in order to model different branching fractions. These 
samples are fit and the extracted branching fractions are 
compared with the branching fractions used to determine 
the amount of embedded signal events. Biases of no more 
than 20% of the statistical uncertainty on the result of an 
individual fit are observed, depending on the decay mode 
and the number of signal events. In addition, for B = 
fits, we observe that 0.1% to 1.5% of fits (depending on 
decay mode) have no candidates in the signal region, and 
are thus unable to constrain the signal parameters. To 
avoid unmathematical negative PDF values [23[ , we refit 
these cases in the data with a constraint that the PDF 
must be positive throughout the fitting region. 

Systematic uncertainties are due to uncertainties on 
A and A£ branching fractions, the total number of 
B mesons produced during the experiment's lifetime, 
and the tracking and PID efficiencies, which arc deter- 
mined from control samples in data. We use the mea- 
sured branching fractions and associated uncertainties 
for r(45) B+B- and 7(45) ->■ B°B°, which are 
(51.6 ± 0.6) x 10~ 2 and (48.4 ± 0.6) x 10~ 2 , respec- 
tively For the A+i~ mode, the systematic un- 
certainty is dominated by the 26% uncertainty on the 
Aj — > pK~Tr + branching fraction; the other uncertain- 
ties contribute about 3%. We do not assign any system- 
atic uncertainty due to the assumption of an unpolar- 
ized final state. Systematic uncertainties from the fixed 
PDF parameters are considered to be negligible. The to- 
tal systematic uncertainties are estimated to be 26% for 
B° ->■ A+£-, 3.0% for B~ -> Afi~ and B~ -> Afi~, and 
2.5% for B- -> AeT and B~ -> Ae~ . 

The data and the fit projections are shown in Figs.[TJ[2j 
and M for B° -> A+£~, B~ -> Ai~, and B~ -> Ai~, 
respectively. 
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FIG. 1. Data for all events in the fitting region with overlaid 
fit results for B° — > A^l~ candidates. The left column is for 
the muon mode and the right column the electron mode. Dis- 
tributions of (a,b) mES, ( c ,d) AE, and (e,f) neural net (NN) 
output are shown. Dashed lines represent the background 
components of the fit and solid lines represent the sum of the 
signal and background components. The lack of a significant 
signal makes the solid and dashed lines indistinguishable in 
some plots. In subfigure (e) and (f), the vertical dashed line 
indicates the selection criteria on the neural net output. 



VI. RESULTS 

No significant signal is observed and an upper limit 
is calculated for the branching fraction for each decay 
mode. To calculate the upper limit, the branching frac- 
tion is varied around the best fit value Bhcst and the other 
parameters arc refit to map out the difference in the ln- 
likelihood: Aln£ = ln£(Bbest) — In £(£?). We integrate 
the function y — e ~ Aln£ over B. While the fit allows 
the branching fraction to assume negative values, we ig- 
nore the unphysical region with B < and calculate the 
integral for B > 0. We determine the value of the branch- 
ing fraction B 90 % for which 90% of the area lies between 
B = and /Bgo% and interpret this as the upper limit at 
90% confidence level. The results from the fit are given in 
Table |H Since the biases observed in Monte Carlo studies 
are small compared to the statistical uncertainties, we do 



not include their effect in the results. For the Ae~ decay 
mode, there are no candidates in the signal region. The 
fitted branching fraction for this decay mode is equal to 
the limit in the fit determined by the requirement that 
the PDF be positive throughout the fitting region. 



VII. SUMMARY 

Searches are performed for the decays B° A+£~, 
B~ -> A£~ and B~ ->• A£~ , using the full BaBAR data 
set. No significant signal for any of the decay modes 
is observed and upper limits are determined at the 90% 
confidence level. 

We are grateful for the excellent luminosity and ma- 
chine conditions provided by our PEP-II colleagues, and 



9 



a) B-> A \i 




-B.2 -0.1 o 0.1 
A E (GeV) 



: 30 



; 20 



S 10 



^ 30 

o 



20 



10 



b)£^Al 




5.22 5.24 5.26 5.28 

m ES (GeV/c 2 ) 
d) A e 




-0.1 0.1 
A E (GeV) 



0.2 



> ' 
ID 

C3 



v in i 





a) B— > A li" 



















5.22 5.24 5.26 5.28 

m ES (GeV/c 2 ) 



C) 5^ A u" 




-10.2 -0.1 0.1 

A E (GeV) 



30- 



! 20- 



5 30- 

O 

o 

5 20- 



b) B-> A e 




5.22 5.24 5.26 5.28 

m ES (GeV/c 2 ) 
d) B— > A e 



i.2 -0.1 0.1 
A E (GeV) 



FIG. 2. Data for all events in the fitting region with overlaid 
fit results for B~ — > Al~ candidates. The left column is for 
the muon mode and the right column the electron mode. Dis- 
tributions of (a,b) mES and (c,d) AE are shown. Dashed lines 
represent the background components of the fit and solid lines 
represent the sum of the signal and background components. 
The lack of a significant signal makes the solid and dashed 
lines indistinguishable in some plots. 



FIG. 3. Data for all events in the fitting region with overlaid 
fit results for B~ — > Al~ candidates. The left column is for 
the muon mode and the right column the electron mode. Dis- 
tributions of (a,b) mES and (c,d) AE are shown. Dashed lines 
represent the background components of the fit and solid lines 
represent the sum of the signal and background components. 
The lack of a significant signal makes the solid and dashed 
lines indistinguishable in some plots. 



TABLE I. Total number of candidates used in the fit (A?" can d), 
the central value for the branching fraction returned by the fit 
(B), signal efficiency (e) excluding the contribution from the 
yl( c ) branching fraction, and upper limits on the branching 
fraction at 90% confidence level (£>g %) for each decay mode. 
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